A General Framework for Bounding Approximate Dynamic Programming Schemes

نویسندگان

چکیده

For years, there has been interest in approximation methods for solving dynamic programming problems, because of the inherent complexity computing optimal solutions characterized by Bellman's principle optimality. A wide range approximate (ADP) now exists. It is great to guarantee that performance an ADP scheme be at least some known fraction, say ?, optimal. This letter introduces a general approach bounding methods, this sense, stochastic setting. The based on new results greedy string optimization where one choose (ordered set) actions maximize objective function. technique inspired submodularity theory, but not required establishing bounds. Instead, quantifying certain notions curvature functions; smaller curvatures better bound. key insight any surrogate function coincides its solution and value with those original control problem. then yields mentioned above, determine ? depend specific ADP.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework

Deterministic optimization has enjoyed a rich place in transportation and logistics, where it represents a mature field with established modeling and algorithmic strategies. By contrast, sequential stochastic optimization models (dynamic programs) have been plagued by the lack of a common modeling framework, and by algorithmic strategies that just do not seem to scale to real-world problems in ...

متن کامل

A General Framework for Dynamic Programming Equation and Time Consistency

Time consistency is an important property for a sequence of stochastic optimization problem, especially with regard to risk attitudes. It is known that a sequence of problem is time-consistent if a Dynamic Programming equation holds. In this paper we present a general framework for Dynamic Programming where general aggregators in time and uncertainties are considered instead of the expectation ...

متن کامل

ExpAX: A Framework for Automating Approximate Programming

We present ExpAX, a framework for automating approxi-mate programming. ExpAX consists of these three compo-nents: (1) a programming model based on a new kind of pro-gram specification, which we refer to as error expectations.Our programming model enables programmers to implicitlyrelax the accuracy constraints without explicitly marking op-erations as approximate; (2) an ...

متن کامل

Recursion Schemes for Dynamic Programming

Dynamic programming is an algorithm design technique, which allows to improve efficiency by avoiding re-computation of identical subtasks. We present a new recursion combinator, dynamorphism, which captures the dynamic programming recursion pattern with memoization and identify some simple conditions when functions defined by structured general recursion can be redefined as a dynamorphism. The ...

متن کامل

Expectation-Oriented Framework for Automating Approximate Programming

We describe ExpAX, a framework for automating approximate programming based on programmer-specified error expectations. Three components constitute ExpAX: (1) a programming model based on a new kind of program specification, which we refer to as expectations. Our programming model enables programmers to implicitly relax the accuracy constraints without explicitly marking operations approximate;...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2021

ISSN: ['2475-1456']

DOI: https://doi.org/10.1109/lcsys.2020.3003477